Goto

Collaborating Authors

 target point



A Additional experimental details

Neural Information Processing Systems

RBF kernel to increase pretraining data diversity. Architectural details In all experiments, we use the same ExPT architecture. This section details how we constructed new objectives from the original D'Kitty and Ant that we In Ant-Energy, the reward at each time step is: R =1+ Survival reward Control cost Contact cost, (6) which means we incentivize the robot to conserve energy instead of running fast. D'Kitty tasks In D'Kitty, the goal is to design a morphology that allows the D'Kitty robot to reach We found the approximate oracle provided by Design-Bench not accurate enough to provide a reliable comparison of optimization methods on this task. C.1 Effects of GP hyperparameters We empirically examine the impact of two GP hyperparameters, the variance and the length scale ` Specifically, we evaluate the performance of ExPT on D'Kitty We average the performance across 3 seeds.



Supplementary Material for Accurate Interpolation for Scattered Data through Hierarchical Residual Refinement Shizhe Ding

Neural Information Processing Systems

In the embedding phase, NIERT uniformly embeds both observed and target points. A learnable mask vector is introduced for target points lacking value data. The NIERT interpolator's core is a Transformer encoder with a masked self-attention mechanism, uniformly encoding observed and The NIERT, a Transformer encoder-only architecture that uniformly encodes observed points and models their correlations, exhibits superior interpolation accuracy. Our proposed architecture, specifically adapted to HINT's overall framework, introduces HINT employs residuals on observed points to estimate residuals on target points. Table 1: Statistics of the interpolation tasks used for training in each dataset.Dataset d Theoretical dataset II: Perlin is another synthetic assembly of interpolation tasks, specifically designed for the numerical interpolation of two-dimensional rough functions.



ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Consistent Attention

He, Huiguo, Yan, Pengyu, Yi, Ziqi, Zhong, Weizhi, Liu, Zheng, Tang, Yejun, Yang, Huan, Gai, Kun, Li, Guanbin, Jin, Lianwen

arXiv.org Artificial Intelligence

Drag-based image editing aims to modify visual content followed by user-specified drag operations. Despite existing methods having made notable progress, they still fail to fully exploit the contextual information in the reference image, including fine-grained texture details, leading to edits with limited coherence and fidelity. To address this challenge, we introduce ContextDrag, a new paradigm for drag-based editing that leverages the strong contextual modeling capability of editing models, such as FLUX-Kontext. By incorporating VAE-encoded features from the reference image, ContextDrag can leverage rich contextual cues and preserve fine-grained details, without the need for finetuning or inversion. Specifically, ContextDrag introduced a novel Context-preserving Token Injection (CTI) that injects noise-free reference features into their correct destination locations via a Latent-space Reverse Mapping (LRM) algorithm. This strategy enables precise drag control while preserving consistency in both semantics and texture details. Second, ContextDrag adopts a novel Position-Consistent Attention (PCA), which positional re-encodes the reference tokens and applies overlap-aware masking to eliminate interference from irrelevant reference features. Extensive experiments on DragBench-SR and DragBench-DR demonstrate that our approach surpasses all existing SOTA methods. Code will be publicly available.


A Learning-based Control Methodology for Transitioning VTOL UAVs

Lin, Zexin, Zhong, Yebin, Wan, Hanwen, Cheng, Jiu, Sun, Zhenglong, Ji, Xiaoqiang

arXiv.org Artificial Intelligence

Transition control poses a critical challenge in Vertical Take-Off and Landing Unmanned Aerial Vehicle (VTOL UAV) development due to the tilting rotor mechanism, which shifts the center of gravity and thrust direction during transitions. Current control methods' decoupled control of altitude and position leads to significant vibration, and limits interaction consideration and adaptability. In this study, we propose a novel coupled transition control methodology based on reinforcement learning (RL) driven controller. Besides, contrasting to the conventional phase-transition approach, the ST3M method demonstrates a new perspective by treating cruise mode as a special case of hover. We validate the feasibility of applying our method in simulation and real-world environments, demonstrating efficient controller development and migration while accurately controlling UAV position and attitude, exhibiting outstanding trajectory tracking and reduced vibrations during the transition process.


Integrated YOLOP Perception and Lyapunov-based Control for Autonomous Mobile Robot Navigation on Track

Chen, Mo

arXiv.org Artificial Intelligence

In the 1990s, the modern scientific and technological revolution marked by computer technology, microelectronics technology, information technology, network technology, etc., entered a rapid development stage, which became the intrinsic driving force to promote the development of robotics technology, and robotics technology has developed rapidly. Among them, autonomous mobile robots(AMRs) can rely on the sensors they carry to perceive and understand the external environment, make real-time decisions according to the needs of the task, carry out closed-loop control, and operate in an autonomous or semi-autonomous manner. It is a new type of robot with certain self-learning and adaptive ability in known or unknown environment. Navigation is an important problem that needs to be solved for AMRs to realize autonomous control, which refers to the process of mobile robot sensing the environment and its own state through sensors and learning, and realizing the process of pointing to the target autonomous movement in an obstructed environment. Since the first mobile robot, Shakey, was introduced in the 1960s, mobile robot navigation has been receiving a lot of attention due to its comprehensiveness and practicality [1].


A Virtual Mechanical Interaction Layer Enables Resilient Human-to-Robot Object Handovers

Faris, Omar, Tadeja, Sławomir, Forni, Fulvio

arXiv.org Artificial Intelligence

Abstract-- Object handover is a common form of interaction that is widely present in collaborative tasks. However, achieving it efficiently remains a challenge. We address the problem of ensuring resilient robotic actions that can adapt to complex changes in object pose during human-to-robot object handovers. We propose the use of Virtual Model Control to create an interaction layer that controls the robot and adapts to the dynamic changes in the handover process. Additionally, we propose the use of augmented reality to facilitate bidirectional communication between humans and robots during handovers. We assess the performance of our controller in a set of experiments that demonstrate its resilience to various sources of uncertainties, including complex changes to the object's pose during the handover . Finally, we performed a user study with 16 participants to understand human preferences for different robot control profiles and augmented reality visuals in object handovers. Our results showed a general preference for the proposed approach and revealed insights that can guide further development in adapting the interaction with the user . Human-to-robot object handover is a fundamental task that frequently occurs in collaborative manipulation.


Multi-UAV Swarm Obstacle Avoidance Based on Potential Field Optimization

Hu, Yendo, Wu, Yiliang, Chen, Weican

arXiv.org Artificial Intelligence

In multi UAV scenarios,the traditional Artificial Potential Field (APF) method often leads to redundant flight paths and frequent abrupt heading changes due to unreasonable obstacle avoidance path planning,and is highly prone to inter UAV collisions during the obstacle avoidance process.To address these issues,this study proposes a novel hybrid algorithm that combines the improved Multi-Robot Formation Obstacle Avoidance (MRF IAPF) algorithm with an enhanced APF optimized for single UAV path planning.Its core ideas are as follows:first,integrating three types of interaction forces from MRF IAPF obstacle repulsion force,inter UAV interaction force,and target attraction force;second,incorporating a refined single UAV path optimization mechanism,including collision risk assessment and an auxiliary sub goal strategy.When a UAV faces a high collision threat,temporary waypoints are generated to guide obstacle avoidance,ensuring eventual precise arrival at the actual target.Simulation results demonstrate that compared with traditional APF based formation algorithms,the proposed algorithm achieves significant improvements in path length optimization and heading stability,can effectively avoid obstacles and quickly restore the formation configuration,thus verifying its applicability and effectiveness in static environments with unknown obstacles.